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METHOD AND APPARATUS FOR PROVIDING A VERTEX CACHE 
/BACKGROUND OF THE INVENTION 
Field Of The Invention 

This invention relates to computer systems, and more particularly, to 
5 methods and apparatus for providing a cache to hold data representing 
vertices of polygons being displayed by a computer graphics output 
device. 

History Of The Prior Art 

In three dimensional graphics, surfaces are typically rendered by 
10 assembling a plurality of polygons in a desired shape. The polygons are 
conventionally triangles having vertices which are defined in world space 
by three dimensional coordinates, color values, texture coordinates, fog 
values, and other values. The three dimensional world space coordinates 
are translated into screen coordinates in which horizontal and vertical 
15 values define screen position and a depth value determines how near a 
vertex is to the screen and thus whether that vertex is viewed with 
respect to other points at the same screen coordinates- The color values 
define the brightness of each of red /green /blue colors at each vertex and 
thus the color at each vertex. The texture coordinates fix each vertex on 
20 a texture map, a matrix of values stored in memory which together 

describe a pattern to be applied to the surface of the triangle to vary the 
color values in accordance with the pattern. 

The graphics accelerator utilizes the three dimensional coordinates 
received from the central processing unit to define the vertices of a 
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triangle in screen space and from those to determine the individual pixels 
describing each triangle. For each pixel of the triangle, the graphics 
accelerator carries out a series of complex manipulations to determine 
the color values, depth values, texture coordinates, and other attributes 
in two dimensional screen space. Once these attributes are determined 
for a pixel, the accelerator uses the texture coordinates to generate a 
texture value for each pixel in another complex manipulation. Finally, all 
of the screen attributes of a pixel which affect the color of that pixel are 
combined to provide final color values for the pixel; and these pixel data 
are placed with the pixel address and depth in a rendering pipeline. 

As may be appreciated, the amount of data required to define each vertex 
in an advanced three dimensional system is substantial. In one 
particular arrangement, thirty-two bytes of data are required to describe 
a single vertex. To accomplish the operations by which the individual 
pixels which describe the triangle are defined for use by some graphics 
output device, it is first necessary to transfer the data defining each 
vertex of each triangle from system memory to the graphics accelerator 
circuitry. Conventionally, data defining each individual vertex of a 
triangle is individually transferred by the central processing unit to the 
graphics accelerator over the system input/ output (I/O) bus. This 
requires that the central processing unit use the bus control circuitry to 
gain access to the system (I/O) bus in order to transfer the data defining 
each of the vertices. 

Twenty to forty bytes of data are typically required to define all of the 
attributes at each of the vertices in world space. In a computer with a 
thirty-two bit bus I/O bus, five to ten writes by the central processing 
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unit are needed to transfer data describing a single vertex. On the other 
hand, a command may require less than a byte. Consequently, to 
transfer data defining three vertices and a command to render a triangle 
requires between sixteen and thirty-one bus transfers. If each transfer 

5 requires a bus acquisition, it may take from sixteen to thirty-one bus 

acquisitions to transfer the data describing a single triangle. Even when 
burst transfers are utilized, no more than a single vertex can be 
transferred in one burst so as many as four bus acquisitions are required 
to transfer the vertex data and a command to render a single triangle. 

10 Thus, both normal and burst transfers are relatively slow processes 
constrained by bus bandwidth and require a significant amount of 
central processor time. 

It is desirable to transfer data for rendering polygons more rapidly 
between a source of graphics data and graphics accelerator circuitry in a 
15 manner which reduces the effect of bus bandwidth. 

Summary Of The Invention 

This and other desirable results of the present invention are realized by 
apparatus and a method which includes the steps of transferring data 
defining individual vertices of a polygon from a source of graphics data to 
20 a graphics accelerator, caching data defining each of the vertices in a 

cache until all vertices of a complete polygon are described, transferring 
a command to the graphics accelerator to render a polygon defined in 
terms of vertices in the cache, and rendering the polygon defined by the 
data for the vertices. 
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Once data defining all vertices of a complete polygon are in the cache, 
any polygon using those vertices may be rendered by simply transferring 
a command to render the polygon defined in terms of its vertices. Thus, 
if sufficient vertices are in the cache, a series of rendering commands 
may render a series of additional polygons without the need to transfer 
any additional vertices by simply reusing stored vertices to form different 
polygons. 
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These and other objects and features of the invention will be better 
understood by reference to the detailed description which follows taken 
together with the drawings in which like elements are referred to by like 
designations throughout the several views. 

Brief Description Of The Drawings 

Figure 1 is a block diagram describing a computer utilizing the present 
invention. 

FigureTisa~diasi^m of a cache memory which may be used in 
accordance with the presenHrrveotkm with the computer of Figure 1. 

Figure 3 is an illustration of data defining three vertices of a polygon to 
be rendered by a graphics accelerator and a command to render a 
polygon used in accordance with an embodiment of the invention. 

Figure 4 is an illustration of data defining a vertex of a polygon to be 
rendered by a graphics accelerator and a command to render a polygon 
used in accordance with another embodiment of the present invention. 
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Figure 5 is yet another illustration of data defining a vertex of a polygon 
to be rendered by a graphics accelerator in accordance with the present 
invention. 

Figure 6 is a diagram illustrating a first arrangement of polygons defining 
a shape which may be rendered by a graphics accelerator in accordance 
with the present invention. 

Figure 7 is a diagram illustrating a second arrangement of polygons 
defining another shape which may be rendered by a graphics accelerator 
in accordance with the present invention. 

Figure 8 is a diagram illustrating a third arrangement of polygons 
defining another shape which may be rendered by a graphics accelerator 
in accordance with the present invention. 

Figure 9 is a diagram illustrating the use of a hardware controlled cache 
designed in accordance with the present invention. 

Detailed Description 

Figure 1 is a block diagram illustrating a computer 10. The computer 10 
includes a central processing unit 1 1 , main memory 12, system 
input/output (I/O) bus 13, and various I/O devices 14 and 15. The I/O 
device 15 is a graphics accelerator designed in accordance with the 
present invention to speed the transfer of graphics data from the central 
processing unit 11 to the device 15. The graphics accelerator includes 
circuitry for rasterizing the data transferred, applying texture 
information, and rendering the pixel data defining the triangle to a frame 
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buffer. In addition, the device 15 includes a DMA engine 17 and a cache 
20 designed in accordance with the present invention. 

An application program drawing three dimensional shapes typically 
renders surfaces by assembling a plurality of polygons in a desired 
shape. The polygons are conventionally triangles which are defined by 
three dimensional coordinates in world space, color values, and texture 
coordinates of their vertices. Figure 6, for example, is a diagram 
illustrating the representation of a circle by a series of triangles having a 
number of vertices A-H and J. As may be seen, the triangles which 
represent the circle have a number of vertices which are identical from 
triangle to triangle. The vertex J, for instance, appears in each of the 
triangles defining the circle. 

Figure 7 illustrates a rectangular shape defined by a series of triangles 
referred to as a "strip." In the strip of Figure 7, each vertex (other than 
those at the corners of the rectangle) such as L is shared by three 
different triangles. Figure 8 illustrates a larger rectangular shape defined 
by an arrangement of triangles referred to as a "mesh." As may be seen, 
internal vertices such as M in the mesh configuration are shared by six 
different triangles. In fact, in a large mesh, the average number of 
triangles which use each vertex is six. 

Conventionally, an application program executing on the central 
processing unit 1 1 renders each triangle of a shape by transferring data 
defining the three vertices of the individual triangle followed by a 
rendering command for that triangle from system memory to the 
graphics accelerator 15. This requires that the central processing unit 
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1 1 gain access to the bus as described above in order to transfer the 
vertex data for each triangle in the shape. 

Figure 3 is an illustration of the data required to be transferred over the 
system bus to the graphics accelerator in order to define three exemplary 



5 vertices A, B, and J and a command to render a triangle ABJ in forming 
the circle of Figure 6. In Figure 3, each box represents four bytes of 
data. As may be seen, four bytes are required to represent each attribute 
x, y, z, u, and v while less than four bytes are required to represent each 
attribute r, g, and b. Similarly, representing the fog attribute (f) requires 

io .less than four bytes. In contrast to the amount of data required to 

represent a vertex, the command to render the triangle ABJ requires, in 
one embodiment, only four bits of data. Because of the amounts of data 
to be transferred, the transfer operation for pixel data defining one vertex 
of a triangle typically requires as many as ten bus accesses to 

15 accomplish. 

It should be noted that the command format allows a "no-op" command 
to be implemented by naming all three vertices identically (e.g., JJJ); 
since such a command describes a point, no triangle is drawn, and the 
effect is a "no-op" command. 

20 Thus, to render each triangle, the central processing unit 1 1 has to 

acquire access to the system I/O bus 13 as few as ten and as many as 
thirty-one times to transfer the vertex data and a rendering command to 
the graphics accelerator 15. Historically, this has not slowed operations 
appreciably since graphics accelerators have not functioned very rapidly. 

25 However, as graphics accelerators have become faster, central processing 
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units have been unable because of the limited bus bandwidth to supply 
sufficient data to a graphics accelerator to keep the accelerator operating 
at full speed. 

The operation of the computer may be accelerated by a new process by 
which an application program establishes a large transfer buffer in 
memory in which data describing a very large number of vertices may be 
stored. The application program commands a direct memory access 
(DMA) engine which may be positioned with the graphics accelerator 15 
to transfer from the transfer buffer to the graphics accelerator the vertex 
data needed to render a polygon. The application program commands 
the graphics accelerator to render the triangle utilizing the vertex data 
transferred by the DMA engine. Even though the use of a DMA engine 
reduces the use of the central processing unit and thereby accelerates 
operations of the computer as a whole, the same bus bandwidth 
restrictions apply to transfers by the DMA engine so the transfer process 
itself is not faster. 

The present invention reduces the need for bus bandwidth and allows 
triangles to be rendered at a speed at which the graphics accelerator is 
designed to operate. In order to accomplish this, the present invention 
utilizes a cache 20 which is a part of the graphics accelerating device 15 
to store vertices. The cache 20 stores vertices of polygons as they are 
transferred to the device 15 either by a DMA engine or the central 
processing unit. The vertices are then held in the cache 20 until 
replaced. Once at least three vertices of a first triangle has been 
transferred to the graphics accelerating device 15 and stored in the cache 
20, any vertex data in the cache may be used and reused to render any 



8 



NV29 



± O IMS 7 3 3 . .1 0 2 5 1 




triangles defined by those vertices. Since a typical shape rendered on the 
output display (e.g., the shape illustrated in Figure 6) is assembled from 
a plurality of triangles many of which have vertices which are vertices of 
other triangles, if vertices of one triangle are in the cache and a first 
triangle has been rendered using these vertices, it requires only the 
addition of data defining a single additional vertex to allow the rendering 
of a second triangle. Thus, a first triangle (for example, triangle ABJ in 
Figure 6) may be rendered by the transfer of the three vertices A, B, and 
J and a rendering command. Then, a triangle adjoining the first triangle 
ABJ and having two of the vertices of the first triangle (such as the 
triangle BCJ) may be rendered by the transfer of a single additional 
vertex (C) to the cache 20 followed by a command to the graphics 
accelerator 15 to render the triangle BCJ. Thus, rather than requiring 
three vertices and a render command to render an additional triangle 
after the first triangle of an interrelated group of triangles has been 
rendered, only a single additional vertex and a single render command 
need be transferred over the system bus. Since the amount of data 
defining a vertex varies from twenty to forty bytes, this may reduce the 
amount of data to be transferred in order to generate a second triangle 
by up to eighty bytes. This is a significant reduction in the data which 
needs to be transferred in order to render any triangle which utilizes 
vertex data already in the cache 20. In this simple case, the process 
divides the time required to transfer data across the system bus almost 
in half. 

luPigmuST aP embedment of the cache 20 is illustrated in which data 
defining each vertex may be stored~by-the application program executing 
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^the central processing unit 1 1. A particular cache 20 includes 
position for data defining sixteen individual vertices in one embodiment 
and may inchade space for a tag. A position in a cache storing sixteen 
vertices may be selected by the use of only four tag bits. By selecting the 
particular cache positioh^in which the vertices reside, the application 
may utilize vertex data already>§tored in the cache to by simply 
transferring commands designaungTfcevertices to be used in any 
triangle to be rendered. 

The vertices may be stored in the cache 20 in a number of different ways. 
For example, the storage space in the cache may be memory mapped so 
that an application directs data describing a particular vertex to a 
particular memory address and recalls the data from that memory 
address when it is to be used in describing a triangle. The data 
describing a particular vertex may be placed in the cache 20 under 
software control. This allows a cache position to be designated by an 
index which requires significantly less data to describe than does an 
address. Thus, the application program may select cache positions for 
storage of vertex data which may be designated in the manner in which 
the vertices are numbered in Figure 6, for example. A command to 
render a triangle may designate a triangle by these same indexed 
representations and thereby require the transfer of less command data 
over the bus. 

Software control over the storage of the data in the cache 20 may be 
accomplished by an application program which provides the means to 
generate a list of indexed positions in the cache 20 and also provides a 
replacement policy for the data in the cache 20. This allows the 
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application to designate the vertices making up a triangle to be rendered 
in commands in accordance with an indexing designation used for the 
storage space. In a typical case, such a designation requires four bits of 
data space to implement a sixteen entry cache. In such an embodiment 

5 of the invention, the central processing unit store vertices in the cache 
20 in storage spaces which the application program designates. Then, 
the central processing unit designates a triangle to be rendered by simply 
designating the indices of three vertices already stored in the cache 20 
using a single command to the graphics accelerator which directs that 

10 the rendering process commence. 

Software control over the storage of the data in the cache 20 may also be 
accomplished in a similar manner by a software driver for the graphics 
accelerator which is equipped to generate a list of cache positions and to 
execute a replacement policy for data in the cache. 

15 The command data itself may be transferred from an application program 
to the graphics accelerator in a number of ways. An application may 
cause the central processing unit to transfer a command via the system 
I/O bus to the graphics accelerator for execution after vertices have been 
placed in the cache. Such a command may be addressed to a particular 

20 register on the graphics accelerator so that the receipt of vertex 

identifications at the register indicates the command intended. In a 
system utilizing a DMA engine to transfer vertex data, a command may 
be included within the data defining a particular vertex. For example, 
the data needed to represent fog (f) and the data needed to represent 

25 r/g/b values each requires less than the four bytes allocated to it in the 
vertex data stream. The space not used by the fog attribute or the r/ g/b 
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attributes in data defining a vertex may be utilized to transfer a 
command to render a triangle for which all of the vertices are in the 
cache or are included in the particular transfer of vertex data which 
includes the command to render. Such an arrangement of data is 

5 illustrated in Figure 4. This reduces the need to transfer commands 

separately and eliminates any bus access to assist the transfer of such a 
command. Once any two vertices of linking triangles have been 
transferred to the cache 20 using such a transfer method, it is only 
necessary to transfer data defining one additional vertex in order to 

10 commence the rendering of that triangle since the command for 

rendering may be included with the data defining the last vertex . In 
fact, it is possible to embed commands to render more than one triangle 
in the data defining a vertex and thereby lower the amount of data 
transferred even more. 

15 Another manner of transferring commands to the graphics accelerator by 
a DMA engine is to add some data space to the format in which the data 
defining a vertex is transferred. For example, Figure 5 illustrates a 
format in which an additional four bit space is added to the left of the 
vertex definition data. Similarly, command data might be added at the 

20 right or any other position in the format. Using such a format requires 
more data transfer time than does the last described embodiment but is 
quite simple and easy to implement. 

Another modification of the invention allows a series of triangles to be 
generated once vertices have been transferred for those triangles to a 
25 cache 20 in the various manners described above. For example, when 
the shape to be described is a strip such as is shown in Figure 7, all of 
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the vertices except those at the corners of the rectangle are used and 
reused in three different triangles. If only sixteen vertices have been 
transferred to the cache 20, the entire shape shown in Figure 7 may be 
described by sending a series of fourteen commands to render the 
fourteen triangles which may be defined by the sixteen vertices. A 
sequence of twelve bits of command data is sufficient to transfer a 
command to render any single one of the fourteen triangles since four 
bits will identify each vertex in the cache 20. Those skilled in the art will 
appreciate the reduction in data transfer for both vertices and commands 
which this allows. An even greater advantage is obtained in describing a 
shape such as the mesh of Figure 8. If fifteen vertices are placed in the 
cache 20 and a total of sixteen rendering commands are transferred, 
sixteen triangles are described for an output device. 

Similar techniques as those described above for transferring commands 
may also be utilized for transferring data indicating the particular vertex 
which is being designated by a sequence of data being transferred. That 
is, data transferred by an application to a particular register may 
designate data to be stored in a particular cache position. In a data 
transfer using a DMA engine, the particular vertex may be designated by 
data indicated by an index value added to the format or included within 
excess space provided for attributes which do not use the entire space 
allotted by the format. 

One considerable advantage of the present invention is that the cache 20 
may be utilized to draw other than the typical triangles used to describe 
shapes. By defining a number of vertices, the shape of any particular 
quadrilateral may be defined. Thus, polylines (lines with fill of various 
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widths) may be drawn using the cache 20 and describing vertices which 
describe segments of the line. 

An additional embodiment of the invention utilizes a hardware 
arrangement to manipulate the data stored in the cache 20. In order to 
accomplish such an arrangement, vertex data defining a large plurality of 
vertices to be displayed in stored in a portion of memory by the 
application program. For example, an application might place vertex 
data at any of the memory mapped positions in a memory array of 64K 
vertices. Each entry in the cache 20 carries a sixteen bit tag. A 
command initiated by an application includes three vertices each 
described by sixteen bits which are sufficient to access any vertex stored 
in the 64K region of memory. The command may be transferred to the 
graphics accelerator either by the central processing unit or a DMA 
engine on the graphics accelerator in the manner explained above. When 
the command is received by the graphics accelerator, the hardware cache 
controller of the accelerator checks the cache 20 to determine whether all 
of the vertex data exists in the cache 20. If the vertex data exists in the 
cache, the triangle may be rendered immediately. If any of the vertex 
data is not in the cache 20, the accelerator causes the DMA engine to 
read the vertex data in the region maintained by the application in 
memory and write the data to the cache 20. With the vertex data in the 
cache, the triangle is rendered. It should be noted that in such an 
arrangement, vertex data is not written across the system bus by the 
central processing unit. The vertex data is always read from memory by 
the DMA engine on the graphics accelerator. In order to function with 
the cache 20, the hardware control for the cache 20 may utilize a state 
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machine to implement an algorithm for replacing data in the cache where 
new data is required to draw triangles. 

Although the present invention has been described in terms of a 
preferred embodiment, it will be appreciated that various modifications 
5 and alterations might be made by those skilled in the art without 
departing from the spirit and scope of the invention. The invention 
should therefore be measured in terms of the claims which follow. 

What Is Claimed Is: 
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CLAIMS AS AMENDED 



1 V (Amended) A computer comprising 

2 \ a central processing unit; 

3 \abus; 

4 mfemory; and 

5 a graphics accelerator coupled to the bus and including a cache for data associated with 

6 vertices, the dafca for each of the vertices representing its respective attributes including its screen 

7 coordinates, colorVid a fog attribute, the data of any of the vertices that is already resident in the 

8 cache can be used anchreused to render any polygons defined by the vertices. 



1 2. (Amended) A compiler as claimed in Claim 1 in which the cache includes cache 

2 positions for the data associated^dth the vertices the cache positions being indexed by an 

3 application program. 



1 3. (Amended) A computer as claimed iim^laim 1 in which the cache has a memory mapped 

2 storage space for the data associated with the vertices of polygons. 



1 4. (Amended) A computer as claimed in Clain\l in which the cache includes cache 

2 positions for the data associated with the vertices the ctehe positions being indexed by an 

3 address in a data structure of vertices in memory. 
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1 5^ (Amended) A computer as claimed in Claim 1 further comprising a direct memory 

2 access (DMA) engine for transferring the data to the cache. 

1 6. (Amended) A method for rendering polygons comprising: 

2 provictaig indices for designating cache positions of data in a cache, the data being 

3 associated with Vertices of any of the polygons, for each of the vertices the data defines its 

4 respective attributeWcluding its screen coordinates, color and a fog attribute; 

5 storing the datk at the designated cache positions in the cache based on each of the 

6 indices being provided, tnfe data for any of the vertices that is already resident in the cache can be 

7 used and reused to render anyyn&ygons defined by the vertices; and 

8 issuing a command fcfrrmdering a polygon by indicating the indices for selecting the 

9 data associated with the vertices onhe selected polygon. 

1 7. (Amended) A method as claimed iriyClaim 6 in which the step of issuing the command 

2 further includes directing a central processing unit to transfer the command to a register. 

1 8. (Amended) A method as claimed in Claim 6 inWhich the step of issuing the command 

2 further includes embedding the command in the data associated with the polygon. 

1 9. (Amended) A method as claimed in Claim 6 in which the rfto of providing the indices 

2 further includes \ 

17 
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1 1 

3 \creating an array of vertices in a memory, 

4 indexing data for each of the vertices which is stored in the array, 

5 selecting from the array vertices defining a polygon to be rendered, and 

6 transferring to the cache the data for each of the selected vertices. 

1 10. (Amended) A mbthod as claimed in Claim 6 further comprising transferring the data to 

2 the cache by commanding direct memory access (DMA) engine to transfer the data from an 

3 array of vertices in a memoA 

1 11. (Amended) A graphics accrfSrator comprising: 

2 setup circuitry responding t(K data defining attributes of vertices of polygons for 

3 determining attributes of pixels to be displayed, the attributes of each of the vertices including 

4 screen coordinates, color and a fog attribute^nd 

5 a cache circuit for storing the data, the data ofainy of the vertices that is already resident in the 

6 cache circuit can be used and reused to render an^olygons defined by the vertices. 

1 12. (Amended) A graphics accelerator as claimed lk Claim 11 further comprising a direct 

2 memory access (DMA) engine for transferring the data to the cache circuit in response to 

3 commands from an application program. \ 
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Abstract of the Disclosure : 

A method for caching data defining vertices of a polygon to be displayed 
by an input/ output display device including the steps of providing an 
index for a vertex for which data is to be cached, storing data defining 
5 attributes of a polygon at a vertex in a cache under the index provided, 
issuing a command signifying a polygon to be manipulated by indicating 
indices of vertices of the polygon for which data is cached. 
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CERTIFICATE OF MAILING (37 CFR 1.10(A)) 

CERTIFICATE OF MAILING BY "EXPRESS MAIL" - Rule 10: 1 hereby certify that this correspondence is being deposited with 
the U. S. Postal Service "Express Mail Post Office to Addressee" under 37 CFR 1.10 as Express Mail No. EL6S48g2?26US , 
addressed to Box New Patent Application, Assistant Commissioner for Patents, Washington, D.C. 20231 on October 25, 2000 

Date: October 25, 2000 Km, __^I^l5%Zi 



In Re Application of: ~ Examiner: Unassigned 

Priemetal. RECEIVED 
Serial No.: Unassigned Art Unit: ^ 3 j 2Q02 

Filed: October 25, 2000 

For: METHOD AND APPARATUS FOR PROVIDING A VERTEX CACHE OPLA 

Assistant Commissioner for Patents 
Washington, D.C. 20231 

PRFI TMTNARV AMENDMENT 

Sir: 

In conjunction with the concurrent filing of a Continuing Application, and in 
response to the Office Action dated April 25, 2000, the period for response to which is 
extended, by the petition for extension of time, through October 25, 2000, please amend 
the application and consider the remarks as follows: 

In the Specification: 

PREVIOUS AMENDMENTS 

The specification as originally filed in the above-referenced Application is attached 
herewith. However, in the amendment mailed March 17, 2000, the specification has been 
amended as hereafter outlined and these amendments should now be incorporated into the 
original specification as follows: 

On page 4, line 15, replace 'Tigure 2 is a" with --Figures 2A and 2B~; and delete 
"of after "diagram". 
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On page 8, line 20, insert immediately after "20" and before "which" ~, as shown 
in Figures 2A and 2B ,". 

On page 9, line 25, replace "Figure 2," with -Figures 2A and 2B,~. 
On page 10, line 6, delete "to" between "cache" and "by"- 

5 On page 14, line 4, insert immediately after "20 and before the "." -, as shown in 

Figure 9-. 

NFW AMENDMENTS 

Please further implement in the specification the following new amendments: 

On page 1, line 2, before "BACKGROUND" insert a new paragraph as follows: 

io ~B EEEREHCE to BELATED APPLICATION 

This Application is a continuing Application, and claims the benefit of U.S. 
Application No. 09/056,386, filed on April 7, 1998, and further claims the benefit of U.S. 
Application 09/053, 998, filed on April 2, 1998.-. 



On page 4, line 8, before "These" insert a paragraph as follows: 



15 -In one embodiment, the graphics accelerator may be adapted to produce screen 

values for the respective attributes using a texture engine, a scaling circuit, a setup circuit, 
and a lighting pipeline.- 



On page 4, line 15, before replace "Figure 2" with: 



-Figure 2A illustrates an exemplary graphics accelerator circuit which may be used in 
20 accordance with the present invention with the computer of Figure 1 . 



0 



Figure 2B --. 
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On page 9, line 25, replace "In Figure 2," with: 

-Figure 2A illustrates an exemplary graphics accelerator circuit which may be 
used in accordance with the present invention with the computer of Figure 1. In general, 
the circuit 15 includes a setup engine 21 which receives input values for the various 
5 attributes defining the three vertices of any triangular surface being depicted. Before 
transfer to the setup engine 21, the x, y, u, and v values representing the vertices of the 
triangle are processed by a scaling circuit 24. The scaling circuit 24 receives the x, y, u, 
and v values representing the vertices of the triangle and utilizes the maximum and 
minimum values of each to determine a proper scale for a texture map. The x, y, and z 
10 coordinates may be provided by an application program as either screen space or world 
space values. The change from world space to screen space is based on a perspective 
transformation process. The scaling circuit 24 may also determine the pixels within a 
triangle from the x and y screen values of the vertices and generate specific screen x and 
y values for these pixels. The setup circuit 21 utilizes the x, y, and z screen values of the 
15 vertices to determine screen values for each of the attributes of each pixel in the triangle. 
The process of computing perspective-correct screen values for the attributes from world 
space vertex values can expressed by a geometric relationship. Gating circuitry which 
carries out addition, subtraction, multiplication, and division steps can be employed to 
produce perspective correct screen values for each of the attributes at each pixel position. 
20 The texture coordinates provided by the setup circuit 21 are transferred to a texture 
engine 22. The texture engine 22 utilizes those input values to determine texture 
coordinates at a plurality of positions within each pixel in accordance with the foregoing 
discussion of the invention. For example, the texture engine 22 may translate the texture 
coordinates at each of a plurality of positions within a pixel into texture values at those 
25 positions and blend the texture values to realize a final texture value for each pixel. This 
final texture value is transferred to a lighting pipeline 23 where the texture value and the 
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other various attributes from the setup circuit 21 are utilized to modify the color value of 
the particular pixel in the sequence of pixels utilized to describe the triangle. From the 
lighting pipeline, the pixels are transferred to the rendering pipeline. 



1 In the Claims; 

2 Please canc^ldaims 1-12 without prejudice and add new claims 13-24 as follows: 



ft 



3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
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2 
3 
4 
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13. ^. computer comprising 
a Central processing unit; 
abusV 
memor 

a graphicsVcelerator coupled to the bus including a cache for data associated with 
vertices, the data forbach of the vertices representing its respective attributes including its screen 
coordinates, color and aSfog attribute, the data of any of the vertices that is already resident in the 
cache can be used and reused to render any polygons defined by the vertices; and 

a circuit producing scteen values for the respective attributes, the circuit including a 
texture engine, a scaling circuit^ setup circuit, and a lighting pipeline. 

14. A computer as in claini^liifiVhich the cache includes cache positions for the data 
associated with the vertices the cache petitions being indexed by an application program. 

15. A computer as in claim 1 3 in which th^cache has a memory mapped storage space for the 
data associated with the vertices of polygons. 



1 16. A computer as in claim 1 3 in which the cacheNmcludes cache positions for the data 

2 associated with the vertices the cache positions being indexed by an address in a data structure of 

3 vertices in memory. 
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1 17. A dtomputer as in claim 1 3 further comprising a direct memory access (DMA) engine for 

2 transferring the data to the cache. 

1 18. A method foVrendering polygons, comprising: 

2 providing indicts for designating cache positions of data in a cache, the data being 

3 associated with vertices df any of the polygons and defining for each of the vertices its respective 

4 attributes including its screen coordinates, color and a fog attribute; 

5 storing the data at theVesignated cache positions in the cache based on each of the indices 

6 being provided, the data for an\of the vertices that is already resident in the cache can be used 

7 and reused to render any polygons* defined by the vertices; 

8 issuing a command for rendering a polygon by indicating the indices for selecting the 

9 data associated with the vertices of thkrendered polygon; and 

10 producing in a circuit screen valW for attributes respective to the vertices of the rendered 

1 1 polygon, the circuit including a texture pUe, a scaling circuit, a setup circuit, and a lighting 

12 pipeline. 

1 19. A method as in claim 1 8 in which the sttro of issuing the command further includes 

2 directing a central processing unit to transfer the Wnmand to a register. 

3 20. A method as in claim 1 8 in which the step onssuing the command further includes 

4 embedding the command in the data associated with thV polygon. 

1 21. A method as in claim 1 8 in which the step of providing the indices further includes 

2 creating an array of vertices in a memory, \ 

3 indexing data for each of the vertices which is stVed in the array, 

4 selecting from the array vertices defining a polygola to be rendered, and 

5 transferring to the cache the data for each of the selected vertices. 
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22. A method as in claim 18 further comprising transferring the data to the cache by 
commanding a\ect memory access (DMA) engine to transfer the data from an array of vertices 



in a memory. 



l 
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4 
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23. A graphics acceleratbrcomprising: 

setup circuitry responding to data defining attributes of vertices of polygons for 
determining attributes of pixels toVdisplayed, the attributes of each of the vertices including 
screen coordinates, color and a fog Mfcute, the setup circuit communicating with a texture 
engine, a scaling circuit and a lighting-pipeline; and 

a cache circuit for storing the data, th\data of any of the vertices that is already resident 
in the cache circuit can be used and reused to render any polygons defined by the vertices. 



8 
9 

10 



24. A graphics accelerator as in claim 23 further Comprising a direct memory access (DMA) 
engine for transferring the data to the cache circuit in reWse to commands from an application 



program. 
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REMARKS 



THE CLAIMS 



10 



15 



Claims 1-12, have been canceled. New claims 13-24 have been added and are 
presently pending in the Application. . 

THF SPECIFICATION 

The specification has been amended to remove informalities therefrom and render 
the specification consistent with the drawings. The specification is now believed to be 
clear and concise. 

THE DRAWINGS 

A request to amend the drawings in accordance with 37 C.F.R. § 1.12(b)(3) is 
attached herewith. The request is seeking approval of Fig. 2A and a proposed change to 
the original Fig. 2. A red-marked copy of Fig. 2A and Fig. 2B are furnished herewith 
with the request. 

CONCLUSION 

In view of the foregoing amendments and remarks, it is respectfully submitted 
that the application is now in condition for allowance. Accordingly, a Notice of 
Allowance of claims 13-24, is respectfully requested. 

If for any reason an insufficient fee has been paid, the Commissioner is hereby 
authorized to charge any deficiency in payment of required fees associated with this 
communication to Deposit Account 02-3964. 

Date: October 25, 2000 Respectfully submitted, 



Oppenheimer Wolff & Donnelly LLP 
1400 Page Mill Road 
Palo Alto, CA 94304 




By: Leah Sherry, 
Attorney for Applicant 
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Assistant Commissioner for Patents f4AY 3 1 2002 
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Dear Sir: 

Please address all further correspondence in the above-identified application to: 

Leah Sherry 
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In Re Application of: 

Priem et al. 
Serial No.: 09/056,386 
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For: METHOD AND APPARATUS 



Attn: Ms Eugenia Jones 
Office of Petitions 
Crystal Plaza 4 Room C323 
2201 South Clark Place 
Arlington, VA 22202 

Petition Under 37 C.F.R. 1.10(e) 

Commissioner: 

Applicants respectfully request that a Filing Date of October 25, 2000 be given to the attached 
papers which were in fact received in the USPTO on that date, as evidenced by the attached copy 
of the Post Card duly stamped by the PTO on that date, as well as the attached copy of an 
Express Mail Receipt likewise bearing that date. The following facts are alleged in support of 
this Petition: 

(1) That the subject application papers were in fact duly mailed to the USPTO by Express Mail 
on October 25, 2000. 

(2) That on information and belief, it is believed that a second set of at least some of the papers 
were filed on March 27, 2001 pursuant to an Examiner's request. 

(3) That on April 23, 2001, in response to an Examiner's request, a copy of all of the attached 
papers, and further including a copy of a relevant Assignment, were faxed to Examiner K. 
Tung, (USPTO Rm #6803) at 703 308-9051. See attached copy of Fax Cover Letter. 

(4) That this Petition is being filed promptly after having been requested by Examiner Mark 
Zimmerman and Ms Eugenia Jones. 

(5) That the number of the "Express Mail" mailing label was affixed to the papers. See 
documents entitled "CONTINUING APPLICATION TRANSMITTAL" and 
"PRELIMINARY AMENDMENT". 

(6) That the attached documents are a true copy of the documents filed on October 25, 2000 (and 
subsequently provided to the USPTO on March 27, 2001), and by fax on April 23, 2001 . See 
list on Fax cover sheet. 
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(7) That no fees are required for this Petition. However, should any fees be required, the 
Commissioner is hereby authorized to charge deposit account 02-3964. A duplicate of this 
Petition is attached. 



Date: May 30, 2002 

Oppenheimer Wolff & Donnelly LLP 
Customer 25696 
P.O. Box 10356 
Palo Alto, CA 94303 
Telephone: 650-320-4000 




Reg. No. 22,586 



CERTIFICATE OF MAILING 

CERTIFICATE OF MAILING BY "FedEx I hereby certify that this correspondence is being sent by FedEx, USA Airbill Number 8306 2380 
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Att^fc Docket No.: 18659.15 



IN THE PATENT AND TRADEMARK OFFICE 



Examiner: unassigned 



In Re Application of: 

Prion et al. 
Serial No.: unassigned 
Filed: October 25, 2000 
For: METHOD AND APPARATUS FOR PROVIDING A VERTEX CACHE OPLA 



Art Unit: 



RECEIVED 
MAY 3 1 2002 



Assistant Commissioner for Patents 
Washington, D.C 20231 



WF QTTF.ST TO AMFND THE DRAWINGS 



Sir: 



In accordance with 37 C.F.R. § 1 .12(b)(3), Applicants respectfully submit Fig. 2A and 
propose an amendment to original Fig. 2 of the drawings to render the drawing more clear and 
consistent with the specification. The request is seeking approval of a proposed change to 
original Fig. 2. A red-marked copy of Fig. 2A, and Fig. 2B are furnished herewith with the 
request. No new matter is introduced by the proposed amendment to the drawings. 



Date: October 25, 2000 

Oppenheimer Wolff & Donnelly LLP 
1400 Page Mill Road 
Palo Alto, CA 94304 



Respectfully submitted, 

By: Leah Sherry, 
Attorney for Applicant 
Reg. No. 43,918 
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